Speaker identification using minimum classification error training

نویسندگان

Olivier Siohan

Aaron E. Rosenberg

Sarangarajan Parthasarathy

چکیده

In this paper we use a Minimum Classification Error (MCE) training paradigm to build a speaker identification system. The training is optimized at the string level for a text-dependent speaker identification task. Experiments performed on a small set speaker identification task show that MCE training can reduce closed-set identification errors by up to 20-25% over a baseline system trained using Maximum Likelihood Estimation. Further experiments suggest that additional improvement can be obtained by using some additional training data from speakers outside the set of registered speakers, leading to an overall reduction of the closed-set identification errors by about 35%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution

In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe...

متن کامل

Minimum classification error/eigenvoices training for speaker identification

This paper describes a new training approach based on two different techniques (Minimum Classification Error and eigenvoices) in order to achieve a better robustness when only poor training data is provided. In the first two sections of this paper we describe the MCE training and the eigenvoice approach. Then a unified MCE/eigenvoice training algorithm is proposed describing theoretical advanta...

متن کامل

Incremental Speaker Adaptation with Minimum Error Discriminative Training for Speaker Identification

Minimum Classification Error (MCE) has shown to be effective in improving the performance of a speaker identification system [1]. However, there are still problems to solve, such as the variability of the voice characteristics of a particular speaker through time. In this work, we analyze the degradation of a GMM-based textindependent speaker identification system when using test data recorded ...

متن کامل

Incremental speaker adaptation with minimum error discriminative training for speaker identification

متن کامل

Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition

Hidden conditional random fields (HCRFs) are derived from the theory of conditional random fields with hidden-state probabilistic framework. It directly models the conditional probability of a label sequence given observations. Compared to hidden Markov models, HCRFs provide a number of benefits in the acoustic modeling of speech signals. Prior works for training on HCRFs were accomplished with...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Speaker identification using minimum classification error training

نویسندگان

چکیده

منابع مشابه

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution

Minimum classification error/eigenvoices training for speaker identification

Incremental Speaker Adaptation with Minimum Error Discriminative Training for Speaker Identification

Incremental speaker adaptation with minimum error discriminative training for speaker identification

Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition

عنوان ژورنال:

اشتراک گذاری